On the value of intra-motif dependencies of human insulator protein CTCF
ثبت نشده
چکیده
The modularity of the ZOOPS model allows for combining an arbitrary motif model with parameters Θm with an arbitrary flanking model with parameters Θf. The binary latent variable ui handles the situation that the i-th sequence contains (ui = 1) or contains not (ui = 0) a binding site. We model the position of the binding site of width W by the latent variable vi ∈ {1, . . . , Li −W + 1}. The binding site may occur on both strands, so we introduce a third latent variable si ∈ {F,R}, indicating whether the binding site occurs on the forward strand (si = F ) or on the reverse complement strand (si = R). We denote the latent variables of the complete data set by ~u = (u1, . . . , uN ), ~v = (v1, . . . , vN ), and ~s = (s1, . . . , sN ). The conditional likelihood of X given the latent variables (~u,~v,~s) and parameters Θm,Θf is given by
منابع مشابه
On the Value of Intra-Motif Dependencies of Human Insulator Protein CTCF
The binding affinity of DNA-binding proteins such as transcription factors is mainly determined by the base composition of the corresponding binding site on the DNA strand. Most proteins do not bind only a single sequence, but rather a set of sequences, which may be modeled by a sequence motif. Algorithms for de novo motif discovery differ in their promoter models, learning approaches, and othe...
متن کاملIncreased Intragenic IGF2 Methylation is Associated with Repression of Insulator Activity and Elevated Expression in Serous Ovarian Carcinoma
Overexpression of insulin-like growth factor-II (IGF2) is a prominent characteristic of many epithelial ovarian malignancies. IGF2 imprinting and transcription are regulated in part through DNA methylation, which in turn regulates binding of the insulator protein CTCF within the IGF2/H19 imprint center. We have shown that IGF2 overexpression in ovarian cancer is associated with hypermethylation...
متن کاملAnalysis of the Vertebrate Insulator Protein CTCF-Binding Sites in the Human Genome
Insulator elements affect gene expression by preventing the spread of heterochromatin and restricting transcriptional enhancers from activation of unrelated promoters. In vertebrates, insulator's function requires association with the CCCTC-binding factor (CTCF), a protein that recognizes long and diverse nucleotide sequences. While insulators are critical in gene regulation, only a few have be...
متن کاملInsulators as mediators of intra- and inter-chromosomal interactions: a common evolutionary theme
Insulator elements mediate intra- and inter-chromosomal interactions. The insulator protein CCCTC-binding factor (CTCF) is important for insulator function in several animals but a report in BMC Molecular Biology shows that Caenorhabditis elegans, yeast and plants lack CTCF. Alternative proteins may have a similar function in these organisms.
متن کاملSystematic discovery of regulatory motifs in conserved regions of the human genome, including thousands of CTCF insulator sites.
Conserved noncoding elements (CNEs) constitute the majority of sequences under purifying selection in the human genome, yet their function remains largely unknown. Experimental evidence suggests that many of these elements play regulatory roles, but little is known about regulatory motifs contained within them. Here we describe a systematic approach to discover and characterize regulatory motif...
متن کامل